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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/619,032 
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input Set : N:\Crf3\RULE60\09619032.txt 
Output Set: N:\CRF3\09272001\l619032.raw 

SEQUENCE LISTING 
1) GENERAL INFORMATION: 

(i) APPLICANT: Murphy, Dennis 

Reid, John 

(ii) TITLE OF INVENTION: ALPHA - G ALACTOS I DASE 

(iii) NUMBER OF SEQUENCES: 4 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson, P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 92037 
(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/619,032 

(B) FILING DATE: 19-Jul-2000 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/407,806 

(B) FILING DATE: 

(Viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Haile, Ph.D., Lisa A. 

(B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE/DOCKET NUMBER: 09010/004001 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-678-5070 

(B) TELEFAX: 619-68-5099 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO : 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAGAGCG CTCGTCTTTC AC 
(2) INFORMATION FOR SEQ ID NO : 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 




52 
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«. xrcrrTxrr DATE : 09/27/2001 

RAW SEQUENCE LISTING 15*42:46 
PATENT APPLICATION: US/09/619 , 032 TIME. 1* . ^ 

input Set : N:\Crf3\RULE60\09619032.txt 
Output Set: N:\CRF3\09272001\l619032.raw 

67 (D) TOPOLOGY: linear 

6 9 (ii) MOLECULE TYPE: CDNA 

71 xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 31 

7 3 CGGAAGATCT AGGTTCCCCA TTTTCACCCC T 
7 5 (2) INFORMATION FOR SEQ ID NO: 3: 

77 (i) SEQUENCE CHARACTERISTICS: 

78 (A) LENGTH: 1041 base pairs 

79 (B) TYPE: nucleic acid 

80 (C) STRANDEDNESS : single 

81 (D) TOPOLOGY: linear 

83 (ix) FEATURE: 

84 ( A) NAME/KEY: Coding Sequence 

85 (B) LOCATION: 1...1038 

86 (D) OTHER INFORMATION: 

88 _.<?i> S°2?jrS5"« ^?i?cE ! cJi TAT GCC GAA ATC CCA 48 

AA 
Ly 
25 



^ »™ rrr r<vc CTC TTT CAC GGC AAC CTC CAb tai ^ « — 

90 TTG AGA GCG CTC GTC TTT CAL ^ ^ ^ ne prQ 

91 Leu Arg Ala Leu Val Pne axt> ^ 15 

92 1 5 ATV ~ nrA m Ar > ATC CCA GTC ATC GAG 96 

94 AAG AGC GAA CCA AAG GTC ATA GAG AAG GCA TAC ATC CCA ^ ^ 

95 Lys Ser Glu Pro Lys Val He Glu Lys Ala lyr ^ 

96 2 L ™» paa PPT TTT GGG CTC AAC ATA ACG GGC TAT ACC 144 

98 ACA CTG ATT AAA GAA GAA CCT TTT GGG CT ^ ^ ^ ^ 

99 Thr Leu He Lys Glu Glu Pro fne ^ 

100 35 „„ nnr aap rAT ATT ATA CTC GTT AAA GGG GGC ATC GCG 192 
\l] S Zl S S Pro £ E S ne M »1 g. Oly =ly U. -a 

i!l ACT GAC CTG ATA G f «» « « - f C TAG ACG CCA ATA CTC OCC M . 

107 Ser Asp Leu He Glu lie lie Gly Thr ser lyr 8o 

108 65 70 _ _, Aa rTT rAa AGA GAT AGG GTT 288 

110 CTC CTG CCG CTT AGC AGA GTA GAA GCA CAA GTT CAG AGA ^ 

111 Leu Leu Pro Leu Ser Arg Val Glu Ala Gin vai ^ 

112 „ r*A CAC CTC TTC GAG GTT TCT CCA AAG GGA TTC TGG CTG CCA GAG 336 

114 AAG GAA GAG CTC TTC GAG bi Leu prQ Glu 

115 Lys Glu Glu Leu Phe Glu Val Ser Pro i*ys v» y ^ 

116 atp ppt rrc ATA CTG AAG GAC AAC GGT TAT GAG 384 

118 *f Aso Pro fie lie So aS lie Leu Lys Asp Asn Gly Tyr Glu 

119 Leu Ala Asp Pro lie ne t-j-u n*. 

120 115 „„„ \^. n ™ T TTr TCA gct CAT CTC AAC TCG 432 

122 TAT CTA TTC GCC GAC GAG GCG ATG CTT TTC TCA GCT ^ 

123 Tyr Leu Phe Ala Asp Glu Ala Met Leu Phe ser jia 

1 24 130 . ™n CAC CTT ATA AAG GCC CAA AGG 480 

126 GCG ATA AAG CCA ATT AAA CCG CTC CCA CAC CTT ATA 

127 Ala lie Lys Pro He Lys Pro Leu Pro His Leu lie y ^ 

128 145 ^ lln a tp arr TAT CTC CTT CTC AGG GAG CTT AGG 528 

130 GAA AAG CGC TTT AGG TAC ATC AGC TAT CTC C^ ^ ^ ^ ^ 

131 Glu Lys Arg Phe Arg Tyr lie sex ±y 1?5 

i 3 3 2 4 AAG GCG ATA AAG CTC GTT TTT GAA GGT AAG GTA ACG CTA AAG GTC AAA 576 
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RAW SEQUENCE LISTING DATE : J9/27/2001 

PATENT APPLICATION : US/09/619,032 TIME: 15:42:46 

Input Set : N:\Crf3\RULE60\09619032.txt 
Output Set: N:\CRF3\09272001\I619032.raw 

135 Lys Ala He Lys Leu Val Phe Glu Gly Lys Val Thr Leu Lys Val Lys 

Is z s s s s s s s s s s is s s s s 624 

1" CTC ATC GGA AGG CTT CCT CTT ATG AAT CCT AAG AAA GTG GOG AGO TGG 672 
143 Leu lie Gly Arg Leu Pro Leu Met Asn Pro Lys Lys Val Ala Ser Trp 

i 4 46 ATA GAG GAC AAG AAC ATT IH CTA TAC GGC ACC GAT ATA GAG TTC ATT 720 
x^o Hin r Thr Asp Ile Glu pne lie 

147 Ile Glu Asp Lys As 



230 235 



CCC TAT AGG GAC ATT GCA GGC AGA ATG ACT GTT GAG GGA TTA TTA GAG 
G1 Y T T yr Arg Asp lie Ala Gly Arg Met Ser Val Glu Gly Leu Leu Glu 

GTT ATA GAC GAG CTC AAC TCG GAA CTG TGC CCC TCA GAG CTG AAG CAC 
val lie Asp Glu Leu Asn Ser Glu Leu Cys Pro Ser Glu Leu Lys His 



TTG AGG ATA TGG AGA GAG GAC GAA GGG AAC GCA AGA CTT AAT ATG CTG 
lit Arg lie Trp Arg Glu Asp Glu Gly Asn Ala Arg Leu Asn Met Leu 

TAC AAT ATG AGG GGC GAA CTC GCC TTT TTA GCC GAG AAC AGC GAT GCA 
TAC AA1 ftl<^ lu Asn Ser Asp Ala 

Tyr Asn Met Arg Gly Glu Leu aw ^ c 320 



i 9 96 Thr Leu lie Lys Glu Glu Pro Phe sly Leu Asn Ile Thr Gly Tyr Thr 

1 9 98 Leu Lys Phe Leu Pro Lys Asp !!e He Leu Val Lys Gly Gly lie Ala 

55 ou 

199 Tn /-» -i Tla -mo riv Thr Ser Tyr Thr Ala He Leu Pro 

200 Ser Asp Leu He Glu Ile He Gly Tnr ber iyx 



768 



148 225 
150 
151 

152 245 __ _ tVL <vn* r,ar, fTr, AAG CAC 816 

154 

I s S 5 £ S - ™ S S = = 5 S S S = 864 

160 275 ^ A PTT AAT ATG CTG 912 

162 

163 J-Jtiu my -i--^ --ir — =» - 3QQ 

164 290 295 m _ „„„ n _^ n _ AAP Anr GAT GCA 960 

166 

£ GGA TGG GCC CTC SJ «. *» «0 CTG GAT GCC TTC CGG GCG ATA 1008 

171 



£ SI S Pro £ Pro Glu « g Ar, L«» Asp »!. Phe Ar g Ala He 

172 325 1041 

174 TAT AAC GAT TGG AGG GGT AAT GGG GAA CCT TAG 

175 Tyr Asn Asp Trp Arg Gly Asn Gly Glu Pro 

176 340 345 
17 9 (2) INFORMATION FOR SEQ ID NO: 4: 

181 (i) SEQUENCE CHARACTERISTICS: 

182 (A) LENGTH: 346 amino acids 
1 Q3 (B) TYPE: amino acid 
184 (D) TOPOLOGY: linear 
186 (ii) MOLECULE TYPE: protein 
18 8 (v) FRAGMENT TYPE: internal 
190 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

192 Leu Arg Ala Leu Val Phe His Gly Asn Leu Gin Tyr Ala Glu lie Pro 

III lJs Ser Glu Pro Lys Val lie Glu Lys Ala Tyr lie Pro Val He Glu 

2 5 



file://C:\CRF3\Outhol(KVsrI619032.htm 



9/27/01 



203 




RAW SEQUENCE LISTING ™TE: f/f/ 2 ™ 1 

PATENT APPLICATION : US/09/619,032 TIME: 15:42:46 

input Set : N:\Crf3\RULE60\09619032.txt 
Output Set: N:\CRF3\09272001\l619032.raw 



201 65 



70 



75 8° 



202 Leu Leu Pro Leu Ser Arg Val Glu Ala Gin Val Gin Arg Asp Arg Val 



85 



206 Leu Ala Asp III He lie Pro Ala fie Leu Lys Asp Asn Gly Tyr Glu 

III Tyr Leu III Ala Asp Glu Ala Met Leu Phe Ser Ala His Leu Asn Ser 

III Ala 111 Lys Pro He Lys III Leu Pro His Leu lie Lys Ala Gin Arg 

mi IAS 150 

212 Glu Lys Arg Phe Arg Tyr lie Ser Tyr Leu Leu Leu Arg Glu Leu Arg 

214 Lys Ala lie Lys Leu Val Phe Glu Gly Lys Val Thr Leu Lys Val Lys 

180 185 

216 Asp lie Glu Ala Val Pro Val Trp Val Ala Val Asn Thr Ala Val Met 

III Leu lie Gly Arg Leu Pro Leu Met Asn Pro Lys Lys Val Ala Ser Trp 

219 210 
220 

221 225 



lie Glu Asp Lys Asn lie Leu Leu Tyr Gly Thr Asp He Glu Phe lie 

& — J 



230 



222 Gly Tyr Arg Asp He Ala Gly Arg Met Ser Val Glu Gly Leu Leu Glu 



245 



224 Val lie Asp Glu Leu Asn Ser Glu Leu Cys Pro Ser Glu Leu Lys His 

2 6 5 



225 260 



Ser Gly Arg Glu Leu Tyr Leu Arg Thr Ser Ser Trp Ala Asp Lys Ser 



Leu Arg He Trp Arg Glu Asp Glu Gly Asn Ala Arg Leu Asn Met Leu 



226 

227 275 
228 

229 290 
230 

231 305 



280 



290 295 JUU 

Tyr Asn Met Arg Gly Glu Leu Ala Phe Leu Ala Glu Asn Ser Asp Ala 

232 Arg Gly Trp Pro Leu Pro Glu Arg Arg Leu Asp Ala Phe Arg Ala He 



233 



234 Tyr Asn Asp Trp Arg Gly Asn Gly Glu Pro 



235 



340 345 
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VERIFICATION SUMMARY , DATE : 09/27/2001 

PATENT APPLICATION: US/09/619,032 TIME: 15:42:47 

Input Set : N:\Crf3\RULE60\09619032.txt 
Output Set: N:\CRF3\09272001\l619032.raw 

Kevword misspelled or invalid format, [(1) GENERAL INFORMATION: ] 
KeTSord misspelled or invalid format, [(ii) TITLE OF INVENTION : ] 
Z misspelled or invalid format, [(A) APPLICATION NUMBER 
Keyword misspelled or invalid format, [(B) FILING DATE:] 
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